Learning with Value-Ramp
نویسندگان
چکیده
We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.
منابع مشابه
Learning and process improvement during production ramp-up
Rapid product lifecycles and high development costs pressure manufacturing "rms to cut not only their development times (time-to-market), but also the time to reach full capacity utilization (time-to-volume). The period between completion of development and full capacity utilization is known as production ramp-up. During that time, the new production process is ill understood, which causes low ...
متن کاملAnalysis of an Adaptive Iterative Learning Algorithm for Freeway Ramp Flow Imputation
We present an adaptive iterative learning based flow imputation algorithm, to estimate missing flow profiles in on ramps and off ramps using a freeway traffic flow model. We use the LinkNode Cell transmission model to describe the traffic state evolution in freeways, with on ramp demand profiles and off ramp split ratios (which are derived from flows) as inputs. The model based imputation algor...
متن کاملRamp loss linear programming support vector machine
The ramp loss is a robust but non-convex loss for classification. Compared with other non-convex losses, a local minimum of the ramp loss can be effectively found. The effectiveness of local search comes from the piecewise linearity of the ramp loss. Motivated by the fact that the `1-penalty is piecewise linear as well, the `1-penalty is applied for the ramp loss, resulting in a ramp loss linea...
متن کاملStructured Ramp Loss Minimization for Machine Translation
This paper seeks to close the gap between training algorithms used in statistical machine translation and machine learning, specifically the framework of empirical risk minimization. We review well-known algorithms, arguing that they do not optimize the loss functions they are assumed to optimize when applied to machine translation. Instead, most have implicit connections to particular forms of...
متن کاملThe Use of Cooperative Approach in Ramp Metering
To ensure higher Level of Service (LoS) at urban motorways, new traffic control concepts are being applied since in most cases there is no available space for infrastructural build-up. For urban motorways, the mostly used control methods are ramp metering combined with additional control methods like variable speed limit control (VSLC). This paper gives a review of the current ramp metering app...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1608.03647 شماره
صفحات -
تاریخ انتشار 2016